The effect of large training set sizes on online Japanese Kanji and English cursive recognizers

نویسندگان

  • Henry A. Rowley
  • Manish Goyal
  • John Bennett
چکیده

Much research in handwriting recognition has focused on how to improve recognizers with constrained training set sizes. This paper presents the results of training a nearest-neighbor based online Japanese Kanji recognizer and a neural-network based online cursive English recognizer on a wide range of training set sizes, including sizes not generally available. The experiments demonstrate that increasing the amount of training data improves the accuracy, even when the recognizer’s representation power is limited.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The Impact of Large Training Sets on the Recognition Rate of Offline Japanese Kanji Character Classifiers

Though it is commonly agreed that increasing the training set size leads to improved recognition rates, the deficit of publicly available Japanese character pattern databases prevents us from verifying this assumption empirically for large data sets. Whereas the typical number of training samples has usually been between 100-200 patterns per category until now, newly collected databases and inc...

متن کامل

Effects of a Large Amount of Artificial Patterns for On-line Handwritten Japanese Character Recognition

This paper describes effects of a large amount of artificial patterns to train an on-line handwritten Japanese character recognizer. We need a huge amount of pattern samples to train recognizers to achieve high recognition performance for on-line handwritten character recognition. However, the existing pattern samples are not enough. We construct distortion models to generate a large amount of ...

متن کامل

Synthesis of Online Handwriting in Indian Languages

Synthesis of handwriting has a variety of applications including generation of personalized documents, study of writing styles, automatic generation of data for training recognizers, and matching of handwritten data for retrieval. Most of the existing algorithms for handwriting synthesis deal with English, where the spatial layout of the components are relatively simple, while the cursiveness o...

متن کامل

Semantic effects in word naming: evidence from English and Japanese Kanji.

Three experiments investigated whether reading aloud is affected by a semantic variable, imageability. The first two experiments used English, and the third experiment used Japanese Kanji as a way of testing the generality of the findings across orthographies. The results replicated the earlier findings that readers were slower and more error prone in reading low-frequency exception words when ...

متن کامل

Exploratory-cumulative vs. Disputational Talk on Cognitive Dependency of Translation Studies: Intermediate level students in focus

The present study set out to determine the effect of implementing exploratory-cumulative talk in comparison to disputational talk on cognitive (meaning development and organization of thought as well as problem solving ability) dependency of intermediate level students in translation studies. In order to achieve the objectives of the study, a quasi-experimental-pretest-posttest-statistical stud...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002